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Abstract 

A systematic review of research on oral language assessments for four-to-eight-year- 
old children was undertaken to support a six-year action research project aimed toward 
co-creating classroom oral language assessment tools with teachers in northern rural 
and Indigenous Canadian communities. Through an extensive screening process, 10 
studies were assessed as highly rated and identified for inclusion in the final review. 
Narrative, vocabulary, and syntax assessments were the most common assessment 
types found in the final review. Assessment practices in all studies in the final review 
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involved gathering language samples in one-on-one adult-directed contexts. The sys¬ 
tematic review also revealed that a preponderance of the research on young children’s 
oral language assessment has been published in speech-language pathology and lan¬ 
guage testing journals. Although educational researchers recognize the importance of 
oral language to children’s literacy and learning, there is a paucity of research on oral 
language assessment conducted by educational researchers and published in educational 
research journals. Implications to policy and classroom practice include recommenda¬ 
tions for increased research collaboration between speech-language pathology research¬ 
ers and literacy researchers along with input from early childhood educators to develop 
oral language assessment instruments that support children’s oral language in classroom 
settings. 

Keywords: oral language, systematic review, early childhood education, young children, 
classroom assessment 


L’evaluation de la langue orale des jeunes enfants : des recom- 
mandations pour les pratiques pedagogiques et pour les 

politiques 


Resume 

Une revue systematique des evaluations de la langue orale chez les enfants de quatre a 
huit ans a ete entreprise en vue d’etayer un projet de recherche-action visant a co-creer 
des outils d’evaluation du langage parle en classe avec des enseignants dans des commu- 
nautes autochtones et dans des communautes rurales dans le nord du Canada. A la suite 
d’un processus de depistage exhaustif, dix etudes ont recu une excellente cote et ete rete¬ 
nues pour la synthese finale. Les types d’evaluations les plus courantes, telles que reper- 
toriees dans la synthese finale, portaient sur les competences narratives, le vocabulaire 
et la syntaxe. Les pratiques devaluation dans toutes les etudes retenues dans la synthese 
finale reposaient sur la collecte d’echantillons de langage dans des echanges interper¬ 
sonnels diriges par un adulte. II est ressorti de la revue systematique que les recherches 
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sur 1’evaluation de la langue orale chez les jeunes enfants sont surtout publiees dans des 
revues axees sur l’orthophonie et revaluation des connaissances linguistiques. Bien que 
les chercheurs en education reconnaissent l’importance du langage parle pour la litteratie 
et l’apprentissage durant l’enfance, il y a peu de recherches sur 1’evaluation de la langue 
orale rnenees par des chercheurs en education et publiees dans des revues centrees sur 
la recherche en education. Sur le plan des politiques et des pratiques pedagogiques, 
il y aurait done lieu que se multiplient des recherches concertees entre les chercheurs 
en orthophonie et les chercheurs en litteratie et qu’on fasse appel aux educateurs de la 
petite enfance pour mettre au point des outils d’evaluation du langage parle afin de favo- 
riser le developpement de la langue orale chez les enfants en classe. 

Mots-cles : langue orale, revue systematique, education de la petite enfance, jeunes 
enfants, evaluation en classe 
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Introduction 

In this article we draw on the results of a systematic review of research evaluating young 
children’s oral language assessment to identify principles and practices for classrooms 
and make recommendations for policy development. We begin with the research-based 
and curriculum-supported assumption that oral language is foundational to literacy and 
to all learning. Children’s oral vocabulary, storytelling performance, and phonological 
awareness at school entry are predictors of later reading comprehension and decoding 
(Dickinson & Porche, 2010; Resnick & Snow, 2009). In addition to using oral language 
to express ideas and intentions in communication with others, children use language to 
organize and reflect on experience (Alexander, 2006; Bames, 1992; Mercer & Littleton, 
2007; Wells, 1999; Vygotsky, 1978). Putting words together to communicate with oth¬ 
ers helps children to clarify and deepen understanding. Oral language is foundational to 
social, as well as conceptual, learning. Children are socialized into the cultural practices 
of their communities through their everyday interactions with others (Cekaite, Blum- 
Kulka, Grover, & Teubal, 2014). Children’s choice of words, sounds and structures, and 
the ways in which they use these features of language to achieve their own purposes, are 
some of the indicators of their social learning. 

The importance of oral language to literacy and all learning is described in many 
language arts curricula in Canada (e.g., Alberta Education, 2000; Manitoba Education, 
2011; Ontario Ministry of Education, 2006; Saskatchewan Ministry of Education, 2010) 
and around the world (Department for Education, 2013; Ministere de l’education natio¬ 
nal, de Tenseignement superieur et de la recherche, 2016). In Ontario, for example, the 
Language curriculum document (Ontario Ministry of Education, 2006) contains an asser¬ 
tion that “oral communication skills are fundamental to the development of literacy and 
essential for thinking and learning” (p. 9). As such, supporting children’s oral language, 
in part through carrying out ongoing, observational formative assessments, should be an 
integral part of classroom practice (Fisher & Frey, 2007). Student learning is enhanced 
when teachers gather and analyze infonnation about students’ learning throughout the 
school year, using it to guide their teaching (Bennett, 2011; Black & Wiliam, 2009; Hon- 
drich, Hertel, Adl-Amini, & Kleime, 2015). 

Given the importance of oral language to literacy and all learning in school and 
beyond, there is a need for educators and educational researchers to devote time and 
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resources to understanding effective ways of assessing and supporting children’s oral 
language in classrooms. This is the premise underlying a six-year action research project 
aimed toward co-creating culturally relevant (Ladson-Billings, 1995) classroom oral lan¬ 
guage assessment and instructional tools and approaches with teachers of four-to eight- 
year-old children in northern rural and Indigenous Canadian communities. Participants 
are Indigenous and non-Indigenous kindergarten and Grade 1 teachers and their students 
in rural northern communities in four provinces. In an ongoing cycle of data collection, 
analysis, and implementation, teachers and researchers video-record children’s play inter¬ 
actions. Based on collaborative analysis of the recordings, we plan together pedagogical 
and assessment approaches that teachers then implement to support children’s language 
learning and development. 

To inform the action research, we conducted a systematic review of research on 
oral language assessment published between 1980 and 2015. We identified research-sup¬ 
ported characteristics of oral language assessment practices for developing assessment 
tools and practices appropriate for primary classrooms whose makeup differs from a Eu- 
ropean-heritage urban/suburban middle-class demographic. The question guiding the sys¬ 
tematic review was: What practices and tools are effective for assessing young children’s 
oral language? We focused on the following aspects of the research on oral language 
assessment because they have greatest relevance to classroom practice, as identified by 
teachers participating in our previous interview research (Peterson, McIntyre, & Forsyth, 
2016): 

1. features of children’s oral language that are assessed in the research studies; 

2. practices for gathering and analyzing oral language of young children; and 

3. considerations for classroom assessment of young children’s oral language. 

Following a description of our systematic review process, we discuss the oral language 
assessment features, practices, and considerations that are found in the identified research 
studies in terms of their potential to inform classroom practice and policy decisions. We 
draw on the literature in the literacy education field and our ongoing action research to 
support our recommendations. 
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Methods 

In 2015, a literacy researcher/teacher educator and a language researcher/speech language 
pathologist developed the systematic review protocol to guide a team of five graduate 
student researchers through the subsequent search and screening process. All phases of 
the review were guided by the Preferred Reporting Items for Systematic Reviews and 
Meta-Analyses (PRISMA) (Moher, Liberati, Tetzlaflf, Altman, & The PRISMA Group, 
2009), which detail the essential elements of a review through a comprehensive check¬ 
list and flow diagram (see prisma-statement.org for further infonnation). The two lead 
researchers agreed on a definition of oral language as consisting of three major compo¬ 
nents: language form (e.g., phonology, morphology, syntax), semantics, and pragmatics/ 
communicative competence (Owens, 2012). Children demonstrate communicative com¬ 
petence when they use language appropriately in various social contexts (Hymes, 1966). 
Oral language is viewed as both “a code whereby ideas about the world are represented 
through a conventional system of arbitrary signals for communication” (Bloom & Lahey, 
1978) and as an expression of “culture and [ways] to express intentions in congruence 
with the culture” (Bruner, 1986, p. 65). Thus, in addition to learning vocabulary, speech 
sounds, and ways of putting words and phrases together, children learn possible roles, 
relationships, perspectives, meanings, and modes of meaning available within their social 
worlds as they learn language (Genishi & Dyson, 2009; Vygotsky, 1978). 

With this sociolinguistic view of language (Bruner, 1986; Vygotsky, 1978) as the 
conceptual framework for our systematic review, we selected three online bibliographic 
databases to conduct the search: ERIC, ProQuest Education, and PsycINFO. In the initial 
phase, two graduate student researchers searched the targeted databases using the core 
search terms: oral language assessment and oral communication assessment. During 
the second electronic search phase, five graduate student researchers searched the same 
electronic databases using a core search tenn and a sub-term. Examples of sub-terms 
include language knowledge, expressive language, language structure, lexical develop¬ 
ment, emerging language, and functional language, among others. For example, a student 
would search for oral language assessment together with expressive language. A third 
non-electronic phase consisted of physical searches of five peer-reviewed journals related 
to early childhood education and oral language development, targeting journals that the 
initial search phase had shown were likely to contain oral language assessment-focused 
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research. One graduate student researcher conducted physical searches in the American 
Journal of Speech Language Pathology, Language, Speech and Hearing Services in 
Schools', Journal of Research in Childhood Education:, Communication Disorders Quar¬ 
terly, and Applied Psycholinguistics following the completion of the electronic searches. 

Screening: Applying Inclusion/Exclusion Criteria 

Guided by the goal of infonning classroom teachers in contexts such as the northern 
Canadian rural and Indigenous classrooms involved in our action research, our configura- 
tive systematic review (Oliver, Dickson, & Newman, 2012) included research that evalu¬ 
ated the effectiveness of particular assessment tools and approaches across a range of cul¬ 
tural contexts. Primary research, published in English from 1980 to 2015, and conducted 
with participants aged four to eight years of age, was targeted for inclusion (Gough & 
Thomas, 2012) in the review. The decision was made to begin the search in 1980 to cover 
a sufficient span of time to capture the types of oral language assessments that may be 
used with young children, while acknowledging that searching further into the past was 
unlikely to be productive to our review. Participant age ranges were permitted to overlap 
with our targeted age range (e.g., a study with participants from ages two years to six 
years would be included because the participant ages crossed over with the target age 
range for the review). A list of key word descriptors related to oral language assessment 
was also applied as an inclusion criterion (e.g., language use, emergent language, devel¬ 
oping language, speech development, expressive language, receptive language, etc.). 

The review protocol and PRISMA work-flow diagram (Moher et ah, 2009) were 
used to screen over 30,000 articles found in the initial search of the two core search 
terms: oral language assessment and oral communication assessment. The language 
researcher/speech pathologist trained two graduate student researchers on the systematic 
review process and these graduate student researchers trained an additional three grad¬ 
uate student researchers. The team of five graduate student researchers exported search 
lists from the three databases, which included article titles, abstracts, publication dates, 
and keywords, using each core search term (e.g., oral language assessment) plus sub- 
term (e.g., expressive language), into Excel files. Following removal of duplicate entries, 
titles and abstracts were reviewed in Excel and inclusion criteria were applied (primary 
research, participant age, publication date, publication language, key words). Where 
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insufficient information was available in the title and abstract (e.g., research participants’ 
ages), the foil article was consulted for the relevant information. In this screening pro¬ 
cess, we identified approximately 1,200 articles for potential inclusion in the review. 

The number of articles meeting the existing inclusion criteria called for a honing of the 
criteria which would better represent the intent of the systematic review’s guiding ques¬ 
tion—to gain a more comprehensive understanding of effective children’s oral language 
assessment tools and practices. 

In a team meeting attended by the five graduate student researchers and the litera¬ 
cy researcher/teacher educator, two additional inclusion criteria were generated. Graduate 
student researchers had noted that much of the identified research in the initial search 
phase targeted non-typically developing children (e.g., hearing-impaired children, chil¬ 
dren with special identified needs), bilingual children, or children learning an additional 
language. They also identified a group of studies that used oral language assessments to 
assess the effectiveness of interventions or were designed to detennine potential rela¬ 
tionships between results of oral language and literacy (primarily reading) assessment 
results. These research studies did not directly address our research question, which 
focused specifically on practices and tools that would inform classroom assessment of 
young children’s oral language. As a result, during the next iteration of the search process 
(Brunton, Stansfield, & Thomas, 2012), we excluded them in the next round of screening. 
We reasoned that studies primarily interested in the relationship between reading and oral 
language development, while important to literacy development, fell outside the scope 
of our research question in that these studies had a primary focus on reading and not oral 
language. 

The second phase of screening, conducted by the five graduate student research¬ 
ers, resulted in 201 non-excluded studies. Two graduate student researchers subsequent¬ 
ly read the resulting 201 studies in their entirety to verify the presence of the inclusion 
criteria (primary research, participant age, publication date, publication language, key 
words) and relevancy to the research question. These two graduate student researchers 
then met to discuss their independent assessments of the 201 articles using the inclusion 
and relevance criteria. They agreed upon 33 studies as meeting all inclusion criteria and 
being relevant to the research question. 
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Final Screening Process 

The final set of 33 research studies was evaluated by applying a research quality and 
research relevance assessment (Gough, 2007). Research quality referred to evaluations of 
validity (internal, external, and construct) and research relevance related to fidelity to the 
review’s research question, including whether language form, content, and use had been 
addressed within the study. The literacy researcher/teacher educator also reviewed the 
final set of 33 research studies for quality and research relevance following the graduate 
students’ assessments, which provided an additional layer of validation to the graduate 
student researchers’ assessments. All three researchers agreed on the final assessments 
for the last set of 33 studies. Ten of the 33 studies resulted in a high overall rating, 18 
studies achieved a medium rating, and five studies received a low overall rating. A study 
receiving a high rating represented high relevancy to the research question, high levels of 
internal, external, and construct validity, and addressed at least two of the three language 
elements targeted (fonn, content or meaning, and use). Only highly rated studies were 
included in the final synthesis. Table 1 provides detailed information about the language 
features assessed, participant information, and the journals in which each of the 10 identi¬ 
fied studies were published. 
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Table 1 . High-rated studies used in discussion 


Citation 

Journal 

Participant 

Age 

Methodology 

Target Demographic 

# of 

Participants 

Language Fea¬ 
tures Assessed 

Bradfield, Besner, 
Wackerle-Hollman, 
Albano, Rodriquez, & 
McConnell (2014) 

Assessment for 
Effective Interven¬ 
tion 

Pilot - 36 to 

71 months 
Main - 48 to 
71 months 

6 language assessments measured 
expressive and receptive vocabu¬ 
lary; correlations calculated across 
all assessments 

Preschool children 

Pilot - 55 

Main - 756 

Vocabulary 

Craig & Washington 
(2002) 

American Journal 
of Speech-Lan¬ 
guage Pathology 

47 to 78 
months 

Spontaneous language samples 
scored for linguistic features (e.g., 
utterance length, complex syntax); 
scores analyzed using ANOVA and 
regression modeling 

African American children 
speaking African American 
English; 

Low and Middle income SES 

100 

Vocabulary 

Craig, Washington, & 
Thompson (2005) 

American Journal 
of Speech-Lan¬ 
guage Pathology 

Grade 1-5 

Picture description task and 
comprehension tasks analyzed for 
linguistic features; results analyzed 
using M ANOVA 

African American children 
speaking African American 
English; 

Low and Middle income SES 

295 

Vocabulary 

Heilmann, Miller, & 
Nockerts (2010) 

Language Testing 

60 to 84 
months 

Narrative retell task scored with 
Narrative Scoring Scheme (NSS); 
scores analyzed through descrip¬ 
tive statistics, including skewness, 
kurtosis and examiner judgement 

Typically developing children 

129 

Narrative features 

Hipfner-Boucher, Mil- 
burn, Weitzman, Green¬ 
berg, Pelletier, & Girol- 
ametto (2014) 

first Language 

46 to 71 
months 

Assessment battery of 10 tests 
included 2 narrative retell tasks and 

1 story generation task; scores an¬ 
alyzed through correlation, linear 
regression modeling 

English speaking children 

89 

Narrative features 

Patton Terry, Mills, 

Language, Speech, 

55 to 71 

Narrative elicitations scored using 

African American children 

146 

Syntax 

Bingham, Mansour, & 
Marencin (2013) 

and Hearing Ser¬ 
vices in Schools 

months 

NSS and analyzed for syntax struc¬ 
tures; results correlated across tasks 

speaking African American 
English 


Narrative perfor¬ 
mance 
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Citation 

Journal 

Participant 

Age 

Methodology 

Target Demographic 

# of 

Participants 

Language Fea¬ 
tures Assessed 

Qi, Kaiser, Milan, & Han¬ 
cock (2006) 

Language, Speech 
& Hearing Services 
in Schools 

36 to 54 
months 

4 tests administered to establish 
convergent validity; scores an¬ 
alyzed through correlation and 
regression analysis 

African American and Euro¬ 
pean American children in 
same community; 

Low income SES 

482 

Vocabulary 

Qi & Mar ley (2011) 

Topics in Early 
Childhood Special 
Education 

34 to 62 
months 

Battery of tests to validate Pre¬ 
school Language Scale for target 
population; results analyzed 
through MANOVA, regression 
analysis, item response theory 

Hispanic American and Euro¬ 
pean American children 

440 

Vocabulary 

Smith, Myers-Jennings, & 
Coleman (2000) 

Communication 
Disorders Quar¬ 
terly 

36 to 71 
months 

4 tests administered to target 
population; scores analyzed using 
ANOVA, descriptive statistics 

Rural children 

African American and Cau¬ 
casian 

middle income SES 

160 

Communicative 

competence 

Schneider & Vis Dube 
(2005) 

American Journal 
of Speech-Lan¬ 
guage Pathology 

62 to 82 
months 

3 story retell tasks scored using 
Systematic Analysis of Language 
Transcripts; results analyzed 
through ANOVA, t tests 

Typically developing English 
speaking children 

44 

Narrative fea¬ 
tures; Syntax 


The final 10 highly rated studies represented a wide range of assessment types and target participants. To mine the studies for their 
insights into oral language assessment, the same two graduate student researchers applied an inductive coding process (Glaser, 1978) to the 
studies to detennine patterns in features of the children’s oral language that were assessed, the assessment practices employed, and consider¬ 
ations for classroom assessment of young children’s oral language. 

Before discussing the results of our systematic review, we present limitations to our search process that must be considered when 
applying our findings to classroom practice and policy development. In the extensive process of having five research assistants screening 
the large number of studies initially identified (over 30,000), it is possible that studies qualifying for full review may have been missed. The 
length of time taken to conduct the review means that studies published since the review began have not been included and we acknowledge 


Canadian Journal of Education / Revue canadienne de T education 40:3 (2017) 

www.cje-rce.ca 






Assessing Young Children s Oral Language 


373 


that there may be more recent research that would be relevant to our research question 
and meet our inclusion criteria. Additionally, we acknowledge that the systematic review 
was undertaken to inform our own action research project with kindergarten and Grade 
1 teachers and children in northern Canadian rural and Indigenous communities. Our 
research goals and sociocultural theoretical lens have influenced our choice of databases 
and inclusion criteria, as well as our evaluation of the 201 studies that were identified in 
the second screening. Because children in our action research study speak and represent 
diverse English dialects, we focused on the literature within these domains. The wealth 
of research on bilingual children’s language learning and development indicates that 
an additional review focusing on this body of the literature is warranted. We must also 
acknowledge that the primary or sole language of all the researchers involved in the re¬ 
view is English, thus limiting the inclusion of articles to those published in English while 
understanding that we live and conduct research in a country (Canada) with two official 
languages and where a multitude of languages are in use. 

One observation of note is that the oral language assessment research identified in 
our search has not been published to a great degree in early childhood education and liter¬ 
acy journals, nor in classroom-based assessment journals. We discuss this finding later in 
our implications for policy section, as well as a discussion of implications for classroom 
practice that arise from our systematic review of the literature. First we present the results 
of the review. 


Results 

As shown in Table 1, narrative comprehension, syntax, and vocabulary were the most 
commonly assessed oral language features in the final set of research studies identified 
in our systematic review process. Communicative competence was assessed as one of a 
number of language features in one of the identified studies. These features demonstrate 
direct correspondence with our initial three-pronged definition of oral language: language 
form (e.g., vocabulary and syntax), semantics (e.g., narrative comprehension), and prag¬ 
matics or communicative competence. We use the assessed features of oral language— 
narrative, syntax, vocabulary, and communicative competence—as organizational 
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categories for reporting the results of the review, followed by a brief discussion of oral 
language assessment practices. 

Narrative Assessments 

Researchers in four of the 10 research studies identified in our systematic review (Heil¬ 
mann, Miller, & Nockerts, 2010; Hipfner-Boucher et ah, 2014; Patton Terry et ah, 2013; 
Schneider & Vis Dube, 2005) gathered language samples by inviting children to narrate a 
story while looking at wordless picture books. These studies’ practices and assumptions 
align with social constructivist views of language and learning to some degree. The use of 
narrative, for example, fits with Bruner’s (1990) argument that narrative structures pro¬ 
vide frames that help children both construct meaning and communicate their understand¬ 
ing of that meaning. Children’s oral narrative retellings of stories from wordless picture 
books, comprised of sentences tied together by themes and narrative structures, provide 
a more comprehensive picture of their language and thinking, than do standardized tests 
that evaluate limited sets of skills that have been stripped of their communicative context 
(Heilmann, Miller, & Nockerts, 2010). Researchers examining children’s oral narratives 
recognize that a number of factors will influence children’s retellings of stories, including 
whether and how many times they have previously heard the story (Martinez & Roser, 
1985), their familiarity with the activities that characters undertake in story events (Hud¬ 
son, Shapiro, McCabe, & Peterson, 1991), the complexity of the story (Heilmann, Miller, 
& Nockerts, 2010), and the adult input (directions and expectations) when eliciting the 
narrative retelling (DeTemple, Wu, & Snow, 1991). 

Researchers conducting two of the narrative-focused studies (Hipfner-Boucher 
et ah, 2014; Schneider & Vis Dube, 2005) examined episodic structure, number of char¬ 
acters identified, initiating event, internal response, and the setting, attempt, and conse¬ 
quence of story events in children’s narratives to examine individual structures within the 
narratives (Schneider & Vis Dube, 2005) or potential relationships between narratives and 
phonological awareness (Hipfner-Boucher et al., 2014). Schneider and Vis Dube (2005) 
found that children in Grade 2 produced significantly more story grammar elements 
than children in kindergarten when stories were presented orally (either with or without 
pictures) by a researcher for retelling by the child in the assessment. The picture-only 
presentation did not result in a significant difference in the number of story of elements 
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produced by the children based on age/grade level. Hipfner-Boucher and colleagues 
(2014) combined narrative assessment with a battery of standardized tests and found 
strong correlations between narrative story structure and phonological awareness. Their 
results support emphasizing the importance of assessing children on narrative abilities as 
well as vocabulary skills and phonological awareness. 

Heilmann, Miller, and Nockerts (2010) assessed the appropriateness of four 
scoring schemes for analyzing 129 five- to seven-year-old children’s oral narratives. 

They used the Narrative Scoring Scheme (NSS) (developed by the Language Analysis 
Lab at the University of Wisconsin-Madison) to examine the macrostructure of children’s 
narratives (e.g., use of story grammar—providing infonnation about the introduction, 
character development and mental states and intentions, conflict/resolution, and con¬ 
clusions, in addition to sequencing events accurately and providing transitions between 
events). The other three assessments were (1) a procedure focusing on plot and theme 
(Reilly, Losh, Bellugi, & Wulfeck, 2004), which assesses setting, instantiation (providing 
concrete examples), episodes, and resolution; (2) an ordinal adaptation of Applebee’s 
Narrative Maturity Scale (Manhardt & Rescorla, 2002), which assesses narratives holis¬ 
tically, placing them on a 5-point scale ranging from a “Heap” of events with no central 
theme, to a “True narrative” with well-developed events and central theme or moral; and 
(3) an ordinal adaptation of Stein’s story levels (Pearce, McCormack, & James, 2003), in 
which an 11-point continuum ranges from an isolated description of random characters 
and actions to interactive episodes taking multiple perspectives of multiple characters and 
showing how their goals mutually influence each other (Heilmann, Miller, & Nockerts, 
2010, pp. 621-622). The NSS achieved a wider, but nonnal, distribution of scores (com¬ 
pared to the other measures) and no child achieved “mastery” (p. 612) (a score over 90%) 
on the measure. Results from each of the other three measures found over one-third of 
participants scoring over 90%, indicating a ceiling effect for the other measures. The NSS 
includes examiner judgement, and thus does not strictly rely on numerical outcomes on 
the discrete elements of assessment and “tap[s] into the perceptual elements of the narra¬ 
tive process” (p. 615). 

Patton Terry and colleagues’ (2013) narrative assessment research examined rela¬ 
tionships between participating children’s use of their spoken African American English 
dialect, narrative macrostructure, and story comprehension. They developed performance 
descriptions of pre-kindergarten African American children’s narrative performance from 
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the fall to the spring of one school year. They found that their participants performed 
within age-appropriate expectations on three measures used in the study (Narrative As¬ 
sessment Protocol, as noted in Pence, Justice, & Gosse, 2007; Narrative Scoring Scheme, 
as noted in Heilmann, Miller, & Nockerts, 2010; Heilmann, Miller, Nockerts, & Dun¬ 
away, 2010; High Point Analysis, as noted in McCabe, Bliss, Barra, & Bennett, 2008) 
and, that as children’s narratives became more complex throughout the school year, the 
scores on two of the assessments significantly improved. These analysis schemes provide 
an abundance of narrative features and ways to assess young children’s narrative compre¬ 
hension for classroom use. 

Viewed through a sociolinguistic lens, the oral narrative assessments in our 
identified studies overlook a critical component of storytelling, however, as they do not 
take into account cultural differences in narrative styles. As Gutierrez-Clellen and Quinn 
(1993) explain, “Storytelling is a social event governed by cultural norms and values. 
These extralinguistic rules dictate appropriate narrative behavior” (p. 4). In classroom 
practice, it is important to consider the ways in which narratives are structured according 
to cultural expectations within the children’s communities when developing schemes for 
assessing children’s oral narratives (Melzi, Schick, & Kennedy, 2011; Michaels, 1981; 
Ochs & Capps, 2001; Peltier, 2010; Piquemal, 2003). The notion of narratives having a 
story grammar consisting of a setting, problem or initiating event, attempts to solve the 
problem, and eventual problem resolution is not a universal view of how narratives are 
structured. Assuming that children’s narrative retellings will conform to the dominant 
narrative structures creates a biased assessment context favouring children who have 
grown up hearing and reading narratives with these structures. 

Syntax Assessments 

Researchers who have published their work in literacy journals have found that children’s 
oral narratives are useful sources of infonnation about children’s story comprehension 
(e.g., Morrow, 1990; van Kraayenoord & Paris, 1996). The studies identified in our 
systematic review, published predominantly in speech-language pathology and language 
testing journals, expand the assessment focus when children’s oral narratives are the data 
sources. They show how the narratives can provide information about children’s use of 
syntax, a language feature that correlates with children’s overall language growth (Craig 
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& Washington, 1994), as well as their later reading achievement (Craig, Connor, & Wash¬ 
ington, 2003). 

In one of the identified studies (Patton Terry et ah, 2013), children’s narrative 
samples were used to examine language productivity (e.g., number of words and number 
of utterances), language complexity (e.g., use of cohesive devices, such as conjunctions, 
clause density, and number of complex structures in each utterance), and accuracy (e.g., 
number of grammatically correct utterances and adequacy of cohesive devices used in 
the retelling). Children’s use of syntactic structures in their oral language was an assessed 
feature of oral language in four of the 10 identified studies in the final review (Craig & 
Washington, 2002; Craig, Washington, & Thompson, 2005; Patton Terry et ah, 2013; 
Smith, Myers-Jennings, & Coleman, 2000). 

The results of three of the 10 identified studies underscore the need for teachers 
to consider local English dialects when assessing the syntax of children’s oral language 
(Craig & Washington, 2002; Craig et ah, 2005; Patton Terry et ah, 2013). The impetus 
for these studies was research showing that “many typically developing African Ameri¬ 
can students fall below published nonnative means on standardized tests that are based 
primarily on mainstream students” (Craig et ah, 2005, p. 121), as well as research show¬ 
ing that children from higher socio-economic communities have performed better than 
children from lower socio-economic communities (Craig & Washington, 2002). Qi and 
Marley (2011) explain in their study that the nonning procedures, selection of test items, 
and the interaction patterns used to gather language samples contribute to findings of cul¬ 
tural bias in standardized tests. Many oral language assessments have been nonned with 
children using Mainstream American English (MAE). The test items require that chil¬ 
dren have experience with typical European American heritage middle-income families’ 
lifestyles, and are familiar with interaction patterns that require labelling or pointing to 
objects. As such, the performance on standardized oral language assessments of children 
who speak English dialects other than MAE may reflect language and cultural experi¬ 
ences that differ from children used to norm the assessments, rather than true language 
delays. 

When the syntax of African American English was recognized in the analysis 
of children’s free-play utterances and narrative retellings, Craig and Washington (2002) 
found that African American children performed within age-appropriate expectations. 
These findings contrast with results of standardized tests that have been nonned using 
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Mainstream American English showing consistent patterns of underachievement of Afri¬ 
can American children (Qi et al., 2006; Qi & Marley, 2011; Smith et ah, 2000). It is ap¬ 
parent that designing linguistically and culturally appropriate assessments should involve 
consultation with community members about communicative practices and dialect with¬ 
in the community. Research studies in our identified set of 10 (e.g., Smith et al., 2000) 
provide a model for considering dialectal differences when scoring children’s responses 
to prompts. Smith and colleagues (2000) found that two tests {Test of Language Develop¬ 
ment-2 Primary [TOLD-2P] and Test of Pragmatic Skills ) showed little to no difference 
between their participants’ results and the nonned group, indicating these tests may be 
culturally valid for a wide range of children. However, the Test of Auditory Comprehen¬ 
sion of Language—Revised (TACL-R) scores were significantly lower for the participant 
children compared to the nonned group, which may indicate a potential for bias. Scores 
on the Patterned Elicitation Syntax Test (PEST) were also significantly lower for the 
participant group compared to the nonned group. However, once scores were adjusted for 
dialectal differences, the number of participants “classified as language disordered was 
reduced by more than two thirds” (Smith et al., 2000, p. 105). 

Vocabulary Assessments 

Given the predominance of vocabulary assessments in the oral language assessment 
research identified in our systematic review (five of the final 10), together with the abun¬ 
dance of research showing correlations between vocabulary and reading comprehension 
(Dickinson & Tabors, 2001) and between vocabulary and children’s abilities to compre¬ 
hend what others say and to communicate with others (Marchman & Fernald, 2008), it 
is clear that vocabulary should be a component of classroom oral language assessment 
practices. 

The identified studies that used the Peabody Picture Vocabulary Test (PPVT-3 or 
4) (Dunn & Dunn, 2007) and the Preschool Language Scale (PLS-3 or 4) (Zimmerman, 
Steiner, & Pond, 2002) included those that attempted to establish a criterion or valida¬ 
tion measure for correlational purposes when testing and validating a newly designed 
vocabulary assessment {Picture Naming task in Bradheld et al., 2014) or other fonns of 
assessments (Craig et al., 2005; Patton Terry et al., 2013), or to examine the validity of 
standardized tests across varied demographic populations (Qi et al., 2006; Qi & Marley, 
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2011). Other vocabulary assessments that involve pointing to pictures representing target 
vocabulary, such as Test of Auditory Comprehension of Language-Revised (TACL-R; 
Carrow-Woolfolk, 1985) and the Test of Language Development-2 Primary (TOLD-2; 
Shulman, 1986) were used by Smith and colleagues (2000) to determine the effect of 
linguistic variation in English on rural preschool children’s language. The Test of Pre¬ 
school Early Literacy (TOPEL; Lonigan, Wagner, Torgesen, & Rashotte, 2007) was 
used alongside the PPVT-4 in Patton Terry and colleagues’ (2013) study of correlations 
between narrative performance, oral expressive and receptive vocabulary, spoken dialect 
use, and story comprehension. The actual tests that were used and the administration 
processes carried out in these vocabulary-oriented studies are not as readily applicable to 
classroom settings as the narrative-oriented studies. For example, in the identified studies, 
an adult, usually a researcher or speech language pathologist, assessed individual chil¬ 
dren’s receptive language by asking them to point to pictures of objects and actions that 
the adult named. Children’s expressive language was assessed when they named objects 
and actions in pictures. However, given that vocabulary is best learned when children 
hear and use words in meaningful contexts (Neuman & Dwyer, 2009) and when children 
are interactive and responsive in social situations, rather than passive listeners of others’ 
language (Harris, Michnick Golinkoff, & Hirsh-Pasek, 2011), it follows that classroom 
assessments should also take place in meaningful interactive contexts. Inviting children 
to name or point to pictures of objects and actions removes their use of vocabulary from 
actual communicative experiences where there are social purposes for the language. 

In two of the studies, researchers targeting vocabulary assessment chose alter¬ 
native instruments or methods to administer their assessments. Bradfield and colleagues 
(2014) went to great lengths to create lists of words familiar to children of the target age 
when developing oral vocabulary assessments by consulting three different word-list 
sources (e.g., Biemiller, 2009; Dale & Fenson, 1996; Hart & Risley, 1999) and eight pub¬ 
lished preschool curricula to develop an appropriate word list for their Picture Naming 
assessment. Only one of the identified 10 research studies (Craig & Washington, 2002) 
involved the recording of spontaneous language samples in 15-20 minute segments 
while children played in pairs with action figures, dolls, and a toy school house. Craig 
and Washington assessed children’s expressive vocabulary by calculating the number of 
different words that children expressed, average clause length, and measures of syntactic 
complexity in the recorded segments. 
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Communicative Competence Assessment 

Fundamental to oral language assessment is gathering information about children’s com¬ 
municative competence; information that shows how children achieve social intentions 
across a wide range of situations encountered in daily life (Genishi & Dyson, 2009; Ochs 
& Capps, 2001). Children’s communicative intent and use of language to achieve social 
purposes comprised a small piece of one of the final set of 10 studies. In this study (Smith 
et ah, 2000), the researchers engaged children in four dyadic-conversations. They asked 
questions to elicit 10 categories of communicative intentions (e.g., greeting, requesting, 
informing, rejecting, reasoning, closing the conversation) using the Test of Pragmatic 
Skills (Shulman, 1986). Smith and colleagues (2000) explain that adult-child conversa¬ 
tions used to assess children’s communicative intent are meant to provide a “more natural 
conversational context” (p. 103). 

While it is true that children are conversational participants in this assessment and 
that the language samples are more authentic than those of the standardized vocabulary 
assessments, the flow of conversation is somewhat contrived in an effort to elicit partic¬ 
ular communicative intentions that align with the assessment protocol. Furthermore, this 
rather restrictive communicative context provides limited information about children’s 
use of language to achieve social intentions, an assessment purpose aligned with the 
sociolinguistic theoretical approach (Halliday, 1975; Vygotsky, 1978) underpinning our 
systematic review. 

Assessment Practices 

As an extension of the communicative competence findings, a discussion of the assess¬ 
ment practices enlisted by researchers in the final 10 studies of the review requires atten¬ 
tion. The most frequently used assessment practice involved adults asking an individual 
child to respond to oral language input related to pictures. This practice was evident in 
all 10 of the studies included in the final synthesis. For example, in the administration of 
the PPVT-4 for receptive vocabulary and TOLD-2 Primary test of semantic, morpholog¬ 
ical, and syntactical receptive language skills, an adult presents pictures of four common 
objects and actions, asking the child to point to one of them. In a similar manner, assess¬ 
ments of children’s expressive vocabulary, grammatical skills, and spoken dialect use 
(e.g., PPVT-4; Expressive One Word Picture Vocabulary Test—III, Picture Naming Test; 
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TOLD-2; TACL-R) ask children to name objects and actions in pictures or repeat stimuli 
provided for pictures. Tests of oral comprehension (Craig & Washington, 2002; Craig 
et ah, 2005) involved an adult asking “Wh” questions about activity pictures and asking 
children to point to pictures showing the actions in sentences with active and passive 
constructions that are spoken aloud by the adult. 

Children’s narrative language samples were elicited from the wordless picture 
book, Frog, Where Are You? (Mayer, 1969) in three studies (Heilmann, Miller, & Nock- 
erts, 2010; Hipfner-Boucher et ah, 2014; Patton Terry et ah, 2013). The adult examiner 
read the story aloud to the child, inviting the child to follow along by looking at the 
pictures, and then asked the child to retell the story. Schneider and Vis Dube (2005) 
presented three stories that they had developed from the pictures of one book (Oops by 
Mercer Mayer, 1977) to typically developing, English-speaking kindergarten and Grade 
2 children in their study. The stories were then presented orally, orally with pictures, or 
with pictures only to each child by a researcher, and the child retold the story to a differ¬ 
ent researcher. 

Adult-child conversations used to assess children’s communicative intent are 
meant to provide a “more natural conversational context” (Smith et ah, 2000, p. 103), 
though it is the adults that select conversation topics and direct the flow of conversation 
in an effort to elicit particular communicative intentions in children’s language. 


Summary 

Our review of oral language assessments for four-to eight-year-old children revealed 
a diverse body of assessments administered for an array of testing purposes. The use 
of narrative assessments focused attention on the story grammar knowledge of young 
children (i.e., narrative macrostructures) (Heilmann, Miller, & Nockerts, 2010; Hipf¬ 
ner-Boucher et ah, 2014; Patton Terry et ah, 2013; Schneider & Vis Dube, 2005), syn¬ 
tactic structures (i.e., narrative microstructures) (Craig & Washington, 2002; Craig et ah, 
2005; Patton Terry et ah, 2013), and in one instance, dialect (Craig & Washington, 2002). 
Vocabulary assessments, such as the PPVT and the PLS, were used, not only to directly 
assess oral language (Qi et al., 2006; Qi & Marley, 2011), but also as validation instru¬ 
ments for other vocabulary assessments (Bradfield et al., 2014), for narrative assessments 
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(Patton Terry et al., 2013), and for others addressing syntax (Craig et ah, 2005). Less 
diversity was observed in the types of assessment practices of the identified studies for 
review. Nine of the 10 studies involved adult-directed assessment practices conducted 
by researchers or speech language pathologists. An acknowledgment of the importance 
in studying underrepresented populations was clear, as six of the 10 studies specifically 
targeted underrepresented ethnic, language, or socio-economic communities (Craig et al., 
2002; Craig & Washington, 2005; Patton Terry et al., 2013; Qi et al., 2006; Qi & Marley, 
2011; Smith et al., 2000). These studies demonstrate some of the limitations of adminis¬ 
tering standardized tests without considering a child’s local dialect and/or socio-economic 
context. 

Our systematic review of research has revealed that researchers in the speech-lan¬ 
guage pathology field are far more likely to take up research questions related to oral 
language assessment, than are educational researchers. Despite educational researchers’ 
recognition of the importance of oral language to children’s learning and literacy, there 
is a paucity of classroom-based educational research and resources for teachers on the 
topic of oral language assessment. As shown through the results of the systematic review, 
the oral language assessment research findings have much to offer classroom teachers in 
terms of the content of oral language assessment (e.g., vocabulary, narrative structures, 
syntactic structures, and communicative competence), including a recognition of the 
syntax of children speaking diverse English dialects. The criterion-referenced assess¬ 
ments conducted in the identified research studies (e.g., analysis of children’s narrative 
retellings and of their spontaneous free-play interactions) provide the needed flexibility 
to capture the complexities of children’s language across linguistic and cultural contexts. 
Given that the research findings of studies highlighted in our systematic review address 
research purposes specific to speech-language pathology and special education, however, 
it is important to integrate the findings with those of research having a classroom focus. 
The integration of the two fields can enrich teachers’ classroom practices to support chil¬ 
dren’s oral language development. 
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Implications 

Our systematic review of oral language assessment research provides implications for 
policy development, as well as classroom practice, including that arising from our action 
research. Given that most of the 201 studies that were relevant to our search and all of the 
10 articles on the final list have been published in journals of speech language pathology, 
communication disorders, language or psychoeducational testing, special education, and 
assessment for intervention, it is apparent that the fields of speech language pathology 
and special education have much to offer classroom teachers in their assessment of young 
children’s oral language. 

For Policy 

Many structures are in place that separate researchers and practitioners in the two 
fields, making it difficult for each to inform the other. For example, the preparation of 
speech-language pathologists often takes place in science faculties and speech-language 
assessment research is generally funded from science research envelopes, whereas 
teacher preparation takes place in social sciences faculties and classroom assessment 
research funds generally come from social science-oriented sources. In our Canadian 
province, funding for speech-language services provided for children comes from either 
the Ministry of Health, Ministry of Community and Social Services, or the Ministry of 
Education, depending on the nature of individual children’s speech and language needs, 
and on their age (Ontario Ministry of Education, 2016). Indeed, it is only through activ¬ 
ities related to our action research project (e.g., carrying out this systematic review and 
being contacted by speech-language pathologists working in the Indigenous communities 
that are part of our project to collaborate in designing dynamic assessment tools appropri¬ 
ate for Indigenous children) that we literacy researchers/teacher-educators have become 
aware of the healthy overlap, in terms of the goals of the two fields, and the considerable 
expertise and knowledge that each field can offer the other. To our knowledge, the collab¬ 
orative research in which speech-language pathologists and we literacy researchers are 
engaging is rather unique. 

An immediate implication for policy, thus, is to create forums for teachers, lit¬ 
eracy educators, and speech language pathologists to collaborate and learn from each 
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other in order to support all children’s oral language in classroom and speech-language 
service settings. Additionally, both fields would benefit from collaborative research that 
integrates research purposes relevant to the work of speech-language pathologists and 
classroom teachers, and draws on the expertise of each. The fundamental nature of oral 
language to children’s learning and literacy should be recognized in policies guiding 
teacher education programs and classroom assessment, as well as the provision of speech 
and language services in schools. 

For Practice 

Our experience in working with kindergarten and Grade 1 teachers and with speech-lan¬ 
guage pathologists as part of our action research shows that both groups of professionals, 
who work directly with young children, recognize the importance of oral language and 
have observed the need for a focus on supporting oral language in classroom instruction 
and in research. The findings of the systematic review show that oral language assessment 
research using children’s oral narratives has direct application to classroom practice, both 
in the methods used to gather language samples and in the features of children’s language 
that are assessed. However, through interviews with 36 primary teachers, early child¬ 
hood educators, and consultants in northern communities in four Canadian provinces, our 
action research project has, so far, highlighted the need for a framework to guide class¬ 
room assessment of children’s oral language. Participants in our action research project 
describe being underprepared to assess and support children’s oral language because 
of a lack of attention on oral language in teacher education programs, and a dearth of 
resources available to teachers (Peterson, McIntyre, & Forsyth, 2016). Teachers partici¬ 
pating in our action research tell us that it is not practical for teachers in classrooms of 20 
or more children to assess children’s use of syntax to the degree that researchers do in a 
one-time, one-on-one gathering of language samples. Additionally, classroom language 
assessments are most helpful to teachers when they provide information about children’s 
response to as wide a range as possible of communicative demands that are placed on 
them in their everyday lives (Barnes, 1992; Halliday, 1975). 

Together with our participant teachers, we are drawing on the oral language as¬ 
sessment research identified in our systematic review to create a manageable classroom 
assessment scheme that will provide useful information about children’s language to 
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inforrn teaching practice. Specifically, we are working on approaches to assessing chil¬ 
dren’s language growth in terms of the length of children’s utterances, as well as their use 
of conjunctions and clauses, in interactions while engaged in dramatic and construction 
play. Sociodramatic play is recommended as a context for supporting (Nicolopoulou, 
McDowell, & Brockmeyer, 2006) and assessing children’s expressive vocabulary, as we 
are finding, where our participant children have used language for 35 specific purposes in 
their play. In sociodramatic play, they most frequently used language for the purpose of 
imagining, and in construction play, they used language most frequently for learning and 
to satisfy their own needs. Participating teachers have co-created a classroom assessment 
tool (Assessing Children’s Use of Language—ACUL) to record and analyze individual 
children’s use of language in play and other contexts where children interact with others 
in the classroom (Portier & Peterson, 2017). 

The professional learning that we all have gained through drawing on each oth¬ 
er’s knowledge and experience shows us, on a small scale, the benefits of collaboration 
between two fields that have traditionally been separate, yet have much in common as we 
draw on assessment information to support young children’s oral language. 
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